Extraction of Characters and Modifiers from Handwritten Gujarati Words

نویسندگان

  • Chhaya Patel
  • Apurva Desai
چکیده

The research activity related to Optical Character Recognition (OCR) for almost all Indian languages is very less. Gujarati script is one of the scripts for which very less literature is available, as far as OCR activities are concerned. This paper describes one of the important phase of OCR, segmentation of handwritten words into its basic components namely basic characters, conjunct characters and modifiers, which are essential for recognition of a word. The paper describes methods for identification of zone boundaries for a word and usage of zone boundaries details for segmenting the word into its subcomponents. Connected component labeling is applied to detect subcomponents of a word, which can be further dissected if needed to obtain other subcomponents of word. It is the first attempt to dissect handwritten Gujarati words into its subcomponents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Structural Approach for Segmentation of Handwritten Hindi Text

This paper makes an attempt to segment the handwritten Hindi words. The problem of segmentation is compounded by the possible presence of modifiers (matras) on all sides of the basic characters and due to the uncertainty introduced in the character shapes by way of different writing styles. We have devised a structural approach to capture the similarities and differences between structure class...

متن کامل

Structural Feature Extraction to recognize some of the Offline isolated Handwritten Gujarati Characters using Decision Tree Classifier

Large amount of information is prevailing on paper and in an era of digital technology it requires it to store this information in electronic format. Using scanner this information can be digitized. Later any modification in terms of add, editing, removing and searching to it requires a technique or methodology which will identify text from image and convert into ASCII or Unicode. This paper pr...

متن کامل

Gujarati handwritten numeral optical character reorganization through neural network

This paper deals with an optical character recognition (OCR) system for handwritten Gujarati numbers. One may find so much of work for Indian languages like Hindi, Kannada, Tamil, Bangala, Malayalam, Gurumukhi etc, but Gujarati is a language for which hardly any work is traceable especially for handwritten characters. Here in this work a neural network is proposed for Gujarati handwritten digit...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Skew Detection and Correction for Gujarati Printed and Handwritten Character using Linear Regression

In this paper, we have proposed approach for skew detection and correction of handwritten and printed Gujarati document using Linear Regression method/technique. Skew detection and correction is important for any recognition system as it directly affects the recognition process of characters/documents. The proposed method work involves linear regression formula for detecting angle of rotation a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013